Multilingual Database Systems
نویسنده
چکیده
Companies and organizations often need to publish clients' information to institutions for research purposes. For example, a hospital periodically releases patients' diagnostic records so that medical scientists can study the correlation between diseases and various factors. Privacy preservation is an important topic in data publication. First, the publication should be fuzzy enough to disallow any adversary to figure out the exact medical history of any patient. On the other hand, the released data must be sufficiently precise to enable effective analysis. In this tutorial, we will review the existing techniques for striking an appropriate balance, in order to maximize the accuracy of data investigation, without breaching any patient's privacy. Speaker’s Profile: Yufei Tao is the winner of the Hong Kong Young Scientist Award 2002, conferred by the Hong Kong Institution of Science. He holds a PhD degree in computer science from the Hong Kong University of Science and Technology, and did his post-doc as a visiting scientist in the Computer Science department of the Carnegie Mellon University, from 2002 to 2003. In the next three years, he was an assistant professor at the City University of Hong Kong. Currently he is an assistant professor at the Department of Computer Science and Engineering, the Chinese University of Hong Kong. Prof. Tao is engaged in research of database systems. His research interests include temporal databases, spatial databases, approximate query processing, data privacy and security. He has published extensively in renowned conferences and journals including ACM SIGMOD, VLDB, IEEE ICDE, ACM TODS, IEEE TKDE, VLDB JOURNAL, etc. (Duration: 1.5 Hours)
منابع مشابه
On Database Support for Multilingual Environments
Global e-Commerce and mass-outreach e-Govemance programs have brought into sharp focus the need for database systems to store and manipulate text data e@ ciently in a suite of natural languages. While some means of storing and querying multilingual data are pmvided by all current database systems, to the best ofour knowledge there has been no prior study of theirfunctionality or eficiency in th...
متن کاملÅùðøøððòòùùð Áòòóöññøøóò Èöó×××òò Óò Êêððøøóòòð Øøøø×× Ööööøøøøùöö×
EÆcient storage and query processing of data spanning multiple natural languages are of crucial importance in today's globalized world. A primary prerequisite to achieve this goal is that the principal data repositories, relational database systems, should eÆciently and seamlessly support multilingual data. Our survey of current relational systems indicates that while they do support storage an...
متن کاملConceptual Database Retrieval through Multilingual Thesauri
In traditional database management systems, information retrieval is often carried out using keywords contained within fields of each record. Because a term (concept) can be expressed in several ways, a significant number of records are ignored by the free text techniques which use only a posteriori relations between terms. This paper proposes the utilisation of a priori conceptual relations be...
متن کاملMulti-lingual Semantic Matching with OrdPath in Relational Systems
The volume of information in natural languages in electronic format is increasing exponentially. The demographics of users of information management systems are becoming increasingly multilingual. Together these trends create a requirement for information management systems to support processing of information in multiple natural languages seamlessly. Database systems, the backbones of informat...
متن کاملMIRA: Multilingual Information Processing on Relational Architecture
In today’s global village, it is critical that the key information tools, such as web search engines, e-Commerce portals and e-Governance, work across multiple natural languages, seamlessly. We propose a new flexible architecture – Multilingual Information processing on Relational Architecture (MIRA) – that supports the multilingual processing functionality of the primary storage mechanism for ...
متن کاملFrameworks, Implementation And Open Problems For The Collaborative Building Of A Multilingual Lexical Database
Many NLP systems are based on lexical data. The development costs of such data are a major drawback in such NLP systems. In order to cut these costs, we adopt a strategy inspired from "opensource" projects to allow volunteers to collaborate in the creation of a multilingual lexical database. For this, we had to specify and develop tools to manage a lexical database containing information comple...
متن کامل